# High-fidelity audio synthesis

Bigvgan 22khz 80band
MIT
BigVGAN is a universal neural vocoder achieved through large-scale training, capable of providing high-quality audio output for tasks such as speech synthesis.
Speech Synthesis
B
nvidia
2,344
1
Bigvgan V2 44khz 128band 512x
MIT
BigVGAN is a universal neural vocoder based on large-scale training, capable of generating high-quality audio waveforms.
Audio Generation
B
nvidia
223.13k
41
Bigvgan V2 44khz 128band 256x
MIT
BigVGAN is a large-scale trained universal neural vocoder capable of high-quality conversion from mel-spectrograms to waveform audio.
Speech Synthesis
B
nvidia
367
7
Bigvgan V2 22khz 80band Fmax8k 256x
MIT
BigVGAN is a large-scale trained universal neural vocoder capable of high-quality mel-spectrogram to waveform conversion. The v2 version accelerates inference through custom CUDA kernels and expands training data diversity.
Speech Synthesis
B
nvidia
1,285
1
Bigvgan V2 22khz 80band 256x
MIT
BigVGAN is a general-purpose neural vocoder trained at scale, capable of generating high-quality audio waveforms from mel spectrograms.
Speech Synthesis
B
nvidia
503.23k
16
Bigvgan V2 24khz 100band 256x
MIT
BigVGAN is a high-performance neural vocoder that achieves high-quality audio synthesis through large-scale training, supporting multiple sampling rates and frequency band configurations.
Audio Generation
B
nvidia
34.03k
14
Musicgen Stereo Melody
MusicGen is a text-to-music generation model developed by Meta AI, capable of producing high-quality stereo music samples based on text descriptions or audio prompts.
Audio Generation Transformers
M
facebook
82
10
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase